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SUPERANTIGENS 

TECHNICAL FIELD 

5 This invention relates to superantigens, and to their use, including in diagnosis 
and/or treatment of disease. 

BACKGROUND ART 

10 Bacterial superantigens are the most potent T cell mitogens known. They stimulate 
large numbers of T cells by directly binding to the side of the MHC class II and'T cell 
Receptor (TcR) molecules. Because they override the normally exquisite MHC 
restriction phenomenon of T cell antigen recognition, they are prime candidates for 
either causing the onset of autoimmune diseases or exacerbating an existing 

15 autoimmune disorder. 

The applicants have identified genes coding for four novel superantigens from S. 
pyogenes. It is broadly to these superantigens and polynucleotides encoding them 
that the present invention is directed. 

20 

SUMMARY OF THE INVENTION 

In one aspect the invention provides a superantigen selected from any one of SMEZ- 
2, SPE-G, SPE-H and SPE-J, or a functionally equivalent variant thereof. 

25 

In a further aspect the invention provides a polynucleotide molecule comprising a 
sequence encoding a superantigen chosen from SMEZ-2, SPE-G, SPE-H, SPE-J, or a 
functionally equivalent variant thereof. 

30 In another aspect of the invention there is provided a method of subtyping 
Streptococci on the basis of superantigen genotype comprising detection of the 
presence of any or all of the above four superantigens or the corresponding genes. 



35 



In a further aspect the invention provides a construct comprising any of the above 
superantigens (or superantigen variants) bound to a cell- targe ting molecule. 
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In yet a further aspect, the invention provides a pharmaceutical composition for 
therapy or prophylaxis comprising a superantigen or superantigen variant as 
described above linked to cell targeting molecule. 

5 

Other aspects of the invention will be apparent from the description provided below. 

DESCRIPTION OF DRAWINGS 

10 While the invention is broadly defined above, it further includes embodiments of 
which the following description provides examples. It will also be better understood 
with reference to the following drawings: 

Fig 1: Multiple alignment of superantigen protein sequences. 

15 

The protein sequence of mature toxins were aligned using the PileUp programme on 
the GCG package Regions of high sequence identity are in black boxes. The boxes 
below the sequences indicate the structural elements of SPE-C, as determined from 
the crystal structure (Roussel et al 1997 Nat. Struct. Biol. 4 no8:635-43). Regions 
20 with highest homology correspond to the 04, P5, <x4 and oc5 regions in SPE-C. The 
clear box near the C-terminus represents a primary zinc binding motif, a common 
feature of all toxins shown. The arrows on top of the sequence alignment show the 
regions of sequence diversity between SMEZ and SMEZ-2. 

25 Figure 2: The nucleotide sequence of the portion of the smez-2 gene coding the 
mature SMEZ-2 superantigen. 

Figure 3: Gel electrophoresis of the purified recombinant toxins. 

30 A. Two micrograms of purified recombinant toxin were run on a 12.5% SDS- 
polyacrylamide gel to show the purity of the preparations; B. Two micrograms of 
purified recombinant toxin were run on an isoelectric focusing gel (5.5% PAA, pH 5- 
8). The isoelectric point (IEP) of rSMEZ-2, rSPE-G and rSPE-H is similar and was 
estimated at pH 7-8. The IEP of rSMEZ was estimated at pH 6-6.5. 
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Figure 4: Stimulation of human T cells with recombinant toxins. 

PBLs were isolated from human blood samples and incubated with varying 
concentrations of recombinant toxin. After 3d, 0. 1 ^iCi [ 3 H]-tiiymidine was added 
5 and cells were incubated for another 24h, before harvested and counted on a 
gamma counter. O, unstimulated; A, rSMEZ; □, rSMEZ-2; rSPE-G; B ; rSPE-H. 

Figure 5: Jurkat cell assay 

10 Jurkat cells (bearing a Vp8 TcR) and LG-2 cells were mixed with varying 
concentrations of recombinant toxin and incubated for 24h, before Sel cells were 
added. After Id, 0.1 |j.Ci [ 3 H]-thymidine was added and cells were counted after 
another 24h. The Vp8 targeting SEE was used as a positive control. The negative 
control was SEA. Both SMEZ and SMEZ-2 were potent stimulators of Jurkat cells, 

15 indicating their ability to specifically target Vp8 bearing T cells. O, unstimulated; 
A, rSEA; □, rSEE; ♦ , rSMEZ; ■, rSMEZ-2. 

Figure 6: Zinc dependent binding of SMEZ-2 to LG-2 cells 

20 LG-2 cells were incubated in duplicates with 1 ng of 125 I labelled rSMEZ-2 and 
increasing amounts of unlabeled toxin at 37°C for lh, and then the cells were 
washed and counted. 

O, incubation in media; A, incubation in media plus ImM EDTA; □, incubation in 
25 media plus 1 mM EDTA, 2 mM ZnCl 2 . 

Figure 7: Scatchard analysis of SMEZ-2 binding to LG-2 cells 

One nanogram 125 I-labeled rSMEZ-2 was incubated in duplicates with LG-2 cells 
30 and a 2-fold dilution series of cold toxin (10 *ig to 10 pg). After lh, cells were 
washed and counted. Scatchard plots were performed as described by Cunningham 
et al 1989 Science 243:1330-1336. 
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Figure 8: Summary of competitive binding experiments. 



Efficiency of each labelled toxin to compete with a 10,000-fold molar excess of any 



The results within the boxes are at the bottom right have previously been published 
(Li et al. 1997). 

Figure 9: Competition binding study with SMEZ-2. 

10 

LG-2 cells were incubated in duplicates with 1 ng of 125 I-labeled rSMEZ-2 and 
increasing amounts of unlabeled rSMEZ-2, rSEA, rSEB, rTSST or rSPE-C. After lh 
cells were washed and counted. 

O, rSMEZ-2; rSEA; □, rSEB; iTSST; rSPE-C. 

15 

DESCRIPTION OF THE INVENTION 

The focus of the invention is the identification of four novel superantigens (SPE-G, 
SPE-H, SPE-J and SMEZ-2) and the corresponding polynucleotides which encode 
20 them. 

Figure 1 shows the amino acid sequences of the above four superantigens together 
with those of previously identified superantigens SMEZ, SPE-C and SEA. 

25 Of the four superantigens SPE-G, SPE-H, SPE-J and SMEZ-2, the latter is of 
greatest interest. 

The smez-2 gene which encodes SMEZ-2 was identified in an experiment designed 
to produce recombinant SMEZ protein from S. pyogenes 2035 genomic DNA. A full 
30 length smez gene was isolated from the strain but the DNA sequence of the smez 
gene of strain 2035 showed nucleotide changes in 36 positions (or 5%) compared to 
smez from strain Ml (Fig. 1). The deduced protein sequences differed in 17 amino 
acid residues (or 8.1%). This difference establishes this as a new gene, smez-2, and 
the encoded protein as a new superantigen, SMEZ-2. 




-2 cells. I — I , no competition; III, 25% 
75% competition; ■ , 100% competition. 
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The most significant difference between SMEZ and SMEZ-2 is an exchanged 
pentapeptide sequence at position 96-100, where the EEPMS sequence of SMEZ is 
converted to KTSIL in SMEZ-2 (Fig. 1). A second difference is at position 111-112, 
where an RR dipeptide is exchanged for GK in SMEZ-2. The remaining 10 different 
residues are spread over almost the entire primary sequence. 

Figure 2 shows the nucleotide sequence encoding mature SMEZ-2 and the deduced 
amino acid sequence. 

The invention is of course not restricted to superantigens/polynucleotides having 
the specific sequences of Figures 1 and 2. Instead, functionally equivalent variants 
are contemplated. 

The phrase "functionally equivalent variants 1 * recognises that it is possible to vary 
the amino acid/nucleotide sequence of a protein while retaining substantially 
equivalent functionality. For example, a protein can be considered a functional 
equivalent of another protein for a specific function if the equivalent protein is 
immunologically cross-reactive with and has at least substantially the same 
function as the original protein. The equivalent can be, for example, a fragment of 
the protein, a fusion of the protein with another protein or carrier, or a fusion of a 
fragment which additional amino acids. For example, it is possible to substitute 
amino acids in a sequence with equivalent amino acids using conventional 
techniques. Groups of amino acids normally held to be equivalent are: 

(a) Ala, Ser, Thr, Pro, Gly; 

(b) Asn, Asp, Glu, Gin; 

(c) His, Arg, Lys; 

(d) Met, Leu, He, Val; and 

(e) Phe, Tyr, Trp. 

Equally, DNA sequences encoding a particular product can vary significantly simply 
due to the degeneracy of the nucleic acid code. 



The probability of one sequence being functionally equivalent to another can be 
measured by the computer algorithms BLASTP (Altschul et al 1990 J Mol Biol 



t 

215:403-410) and FASTA (Pearson et al 1988 Proc Natl Acad Sci 85 2444-2448) for 
proteins and DNA respectively. 

Identification of these superantigens and of their properties gives rise to a number 
5 of useful applications. A first such application is in the genotyping of organisms by 
reference to their superantigen profile. 

An illustration of this is subtyping of strains of S. pyogenes. 

One feature which has been observed is that all clones of S. pyogenes so far found 
to be positive for SMEZ express either SMEZ- 1 or SMEZ-2 but not both. Thus they 
are mutually exclusive, which enables a rapid diagnostic test which tells whether an 
isolate or a patient sample is either SMEZ-1 +ve or SMEZ-2 +ve. This will assist in 
the typing of the isolate. 

This general diagnostic approach is most simply achieved by providing a set or 
primers which amplify either all or a subset of superantigen genes and that 
generate gene specific fragments. This can be modified to provide a simple 
qualitative ELISA-strip type kit that detects biotin labelled PCR fragments amplified 
by the specific primers and hybridised to immobilised sequence specific probes. 
This has usefulness for screening patient tissue samples for the presence of 
superantigen producing streptococcal strains. 

Another approach is to provide monoclonal antibodies to detect each of the 
25 streptococcal superantigens. An ELISA kit containing such antibodies would allow 
the screening of large numbers of streptococcal isolates. A kit such as this would be 
useful for agencies testing for patterns in streptococcal disease or food poisoning 
outbreaks. 

30 Another potential diagnostic application of the superantigens of the invention is in 
the diagnosis of disease, such as Kawasaki Syndrome (KS). 

KS is an acute multi-system vasculitis of unknown aetiology. It occurs world-wide 
but is most prevalent in Japan or in Japanese ancestry. It primarily affects infants 
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and the young up to the age of 16. It is an acute disease that without treatment, 
can be fatal. Primary clinical manifestations include 
Prolonged fever 

Bilateral non-exudative conjunctivitis 
Indurtation and erythema of the extremities 
Inflammation of the lips and oropharynx 
Polymorphous skin rash 
Cervical lymphoadenopathy 

In 15-25% of cases, coronary arterial lesions develop. 



These indications are used as a primary diagnosis of KS. 



15 



In Japan and the US, KS has become one of the most common causes of acquired 
heart disease in children. Treatment involves the immediate intravenous 
administration of gamma globulin (IVGG) during the acute phase of the disease and 
this significantly reduces the level of coronary lesions. 



20 



25 



There are two clear phases to the disease, an acute phase and a convalescent 
phase. The acute phase is marked by strong immune activation. Several reports 
have suggested that superantigens are involved and many attempts have been 
made to link the disease to infection with superantigen producing strains of 
Streptococcus pyogenes. Features of the acute phase of KS are the expansion of VP 
2 and to a lesser extent VP8 bearing T cells and an increase of DR expression T cells 
(a hallmark of T cell activation). 

Because SMEZ-2 stimulates both VP2 and V08 bearing T cells, testing for SMEZ-2 
production is potentially very useful in the diagnosis of KS. 



30 



In addition to diagnostic applications, another application of the superantigens is 
reliant upon their ability to bind to other cells. 



35 



One of the most important features of superantigens is that they bind a large 
number or T cell receptor molecules by binding to the Vp domain. They are the 
most potent of all T cell mitogens and are therefore useful to recruit and activate T 
cells in a relatively non-specific fashion. 
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This ability enables the formation of constructs in which the superantigen (or at 
least the T-cell binding portion of it) is coupled to a cell-targeting molecule, 
particularly an antibody, more usually a monoclonal antibody. 

5 

When a monoclonal antibody that targets a specific cell surface antigen (such as a 
tumor specific antigen) is coupled to a superantigen in such a construct, this 
generates a reagent that on the one hand will bind specifically to the tumor cell, and 
on the other hand recruit and selectively active T cells for the purpose of killing the 
10 targeted cell. 

Bi-specific constructs of this type have important applications in therapy 
(particularly cancer therapy) and may be prepared by means known to those skilled 
in aft. For example SMEZ-2 may be coupled to a tumor specific monoclonal 
15 antibody. The constructs may be incorporated into conventional carriers for 
pharmaceutically-active proteins. 

Various aspects of the invention will now be described with reference to the 
following experimental section, which is included for illustrative purposes. 

20 

EXAMPLE 

Materials and Methods 

2 5 Identification of novel SA Gs 

The novel superantigens were identified by searching the S. pyogenes Ml genome 
database at the University of Oklahoma (http://www.genome.ou.edu/strep.html) 
with highly conserved P5 and <x4regions of streptococcal and staphylococcal 
superantigens, using a TBlastN search programme. 

30 

The open reading frames were defined by translating the DNA sequences around the 
matching regions and aligning the protein sequences to known superantigens using 
the computer programes Gap. Multiple alignments and dendrograms were 
performed with Lineup and Pileup. The FASTA programme was used for searching 
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the SwissProt (Amos Bairoch, Switzerland) and PIR (Protein Identification Resource, 
USA) protein databases. _ 

The leader sequences of SPE-G and SPE-H were predicted using the SP Scan 
5 programme All computer programmes are part of the GCG package (version 8). 

Cloning of smez, smez-2, spe-g and spe-h 

Fifty nanograms of S.pyogenes Ml (ATCC 700294) or S.pyogenes 2035 genomic DNA 
was used as a template to amplify the smez DNA fragment and the smez-2 DNA 

10 fragment, respectively, by PCR using the primers smez-fw 
(TGGG ATC CTT AG AAGT AG ATAAT A) and smez-rev (AAGAATTCTTAGGAGTCAATTTC) 
and Taq Polymerase (Promega). The primers contain a terminal tag with the 
restriction enzyme recognition sequences BamHI and EcoRI, respectively. The 
amplified DNA fragment, encoding the mature protein without the leader sequence 

15 (Kamezawa et al, 1997 Infect. Immun. 65 no9:38281-33) was cloned into a T-tailed 
pBlueScript SKII vector (Stratagene). 

Spe-g and spe-h were cloned in a similiar approach, using the primers spe-g-fw 
(CTGG ATC C G ATG AAAATTTAAAAG ATTTAA) and spe-g-rev 
20 (AAGAATTCGGGGGGAGAATAG), and primers spe-h-fw 
(TTGG ATC C AATTCTTATAATAC AAC C) and spe-h-rev 
( AAAAGCTTTTAGCTGATTG AC AC) , respectively. 

The DNA sequences of the subcloned toxin genes were confirmed by the dideoxy 
25 chain termination method using a Licor automated DNA sequencer. As the DNA 
sequences from the genomic database are all unedited raw data, 3 subclones of 
every cloning experiment were analyzed to ensure that no Taq polymerase related 
mutations were introduced. 

30 Expression and purification ofrSMEZ, r SMEZ-2, rSPE-G and rSPE-H. 

Subcloned smez, smez-2 and spe-g fragments were cut from pBlueScript SKII 
vectors, using restriction enzymes BamHI and EcoRI (LifeTech), and cloned into 
pGEX-2T expression vectors (Pharmacia). Due to an internal EcoRI restriction site 
within the spe-H gene, the pBlueScript: spe-h subclone was digested with BamHI 
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and Hindlll and the spe-h fragment was cloned into a modified pGEX-2T vector that 
contains a Hindlll 3 'cloning site. 

Recombinant SMEZ, rSMEZ-2 and rSPE-H were expressed in E.coli DH5a cells as 
5 glutathione-S-tranferase (GST) fusion proteins. Cultures were grown at 37° C and 
induced for 3-4 h after adding 0.2 mM isopropyl-P-D-thiogalactopyranoside (IPTG). 
GST - SPE-G fusion protein was expressed in cells grown at 28° C. 

The GST fusion proteins were purified on glutathione agarose as described 
10 previously (Li et al, 1997) and the mature toxins were cleaved off from GST by 
trypsin digestion. All recombinant toxins, except rSMEZ, were further purified by 
two rounds of cation exchange chromatography using caxboxy methyl sepharose 
(Pharmacia). The GST-SMEZ fusion protein was trypsin digested on the GSH-column 
and the flow through containing the SMEZ was collected. 

15 

Gel electrophoresis 

All purified recombinant toxins were tested on a 12% SDS-polyacrylamide gel 
according the procedure of Laemmli. The isoelectric point of the recombinant toxins 
was determined by isoelectric focusing on a 5.5% polyacrylamide gel using 
20 ampholine pH 5-8 (Pharmacia Biotech). The gel was run for 90 min at 1 W constant 
power. 

Toxin proliferation assay 

Human peripheral blood lymphocytes (PBL) were purified from blood of a healthy 
25 donor by Histopaque Ficoll (Sigma) fractionation. The PBL were incubated in 96-well 
round bottom micro titer plates at 10 5 cells per well with RPMI-10 (RPMI with 10% 
fetal calf serum) containing varying dilutions of recombinant toxins. The dilution 
series was performed in 1:5 steps from a starting concentration of 10 ng/ml of toxin. 
Pipette tips were changed after each dilution step. After 3 days 0. 1 |j,Ci 
30 [ 3 H]thymidine was added to each well and cells were incubated for another 24 h. 
Cells were harvested and counted on a scintillation counter. 




Mouse leukocytes were obtained from spleens of 5 different mouse strains (SJL, 
B10.M, B10/ J, C3H and BALB/C). Splenocytes were washed in D MEM- 10, counted 
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in 5% acetic acid and incubated on micro titer plates at 10 5 cells per well with 
DMEM-10 and toxins as described for human PBLs. , 
TcR VP analysis. 

5 VP enrichment analysis was performed by anchored multiprimer amplification 
(Hudson et al, 1993, J exp Med 177:175-185). Human PBLs were incubated with 20 
pg/ml of recombinant toxin at 10 6 cells/ml for 3 d. A two-fold volume expansion of 
the culture followed with medium containing 20 ng/ml IL-2. After another 24h, 
stimulated and resting cells were harvested and RNA was prepared using Trizol 

10 reagent (Life Tech). A 500 bp p-chain DNA probe was obtained by anchored 
multiprimer PCR as described previously (38), radiolabeled and hybridized to del (36) 
individual VPs and a CP DNA region dot blotted on a Nylon membrane. The 
membrane was analysed on a Molecular Dynamics Storm Phosphor imager using 
ImageQuant software. Individual VPs were expressed as a percentage of all the VPs 

15 determined by hybridization to the CP probe. 

Jurkat cell assay 

Jurkat cells (a human T cell line) and LG-2 cells (a human B lymphoblastoid cell 
line, homozygous for HLA-DR1) were harvested in log phase and resuspended in 

20 RPMI-10. One hundred microliter of the cell suspension, containing 1x10 s Jurkat 
cells and 2xl0 4 XG-2 cells were mixed with 100 \j! of varying .dilutions of 
recombinant toxins on 96 well plates. After incubating overnight at 37° C, 100 
aliquots were transfered onto a fresh plate and 100 (ol (lxlO 4 ) of Sel cells (IL-2 
dependent murine T cell line) per well were added. After incubating for 24 h, 0. 1 nCi 

25 [ 3 H]thymidine was added to each well and cells were incubated for another 24 h. 
Cells were harvested and counted on a scintillation counter. As a control, a dilution 
series of IL-2 was incubated with Sel cells. 

Computer aided modelling of protein structures 

30 Protein structures of SMEZ2, SPE-G and SPE-H were created on a Silicon Graphics 
computer using Insightll/ Homology software. The superantigens SEA, SEB and 
SPE-C were used as reference proteins to determine structurally conserved regions 
(SCRs). Coordinate files for SEA (1ESF), for SEB (1SEB) and for SPE-C (1AN8) were 
downloaded from the Brookhaven Protein Database. The primary amino acid 

35 sequences of the reference proteins and SMEZ2, SPE-G and SPE-H, respectively, 
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were aligned and coordinates from superimposed SCR's were assigned to the model 
proteins. The loop regions between the SCRs were generated. by random choice. 
MolScript software (PJ Kraulis, 1991, J App Critallography 24:946-50) was used for 
displaying the computer generated images. 

Radiolabeling and LCr-2 binding experiments 

Recombinant toxin was radioiodinated by the chloramine T method as previously 
described (by Li et al. 1997). Labeled toxin was seperated from free iodine by size 
exclusion chromatography using Sephadex G25 (Pharmacia). LG2 cells were used 
for cell binding experiments, as described (Li et al. 1997). Briefly, cells were 
harvested, resuspended in RPMI-10 and mixed at 10 6 cells/ml with 125 I-tracer toxin 
(1 ng) and 0.0001 to 10 pg of unlabeled toxin and incubated at 37° C for 1 h. After 
washing with ice cold RPMI- 1 the pelleted cells were analyzed in a gamma counter. 
For zinc binding assays the toxins were incubated in either RPMI-10 alone, in RPMI- 
10 with 1 mM EDTA or in RPMI- 10 with ImM EDTA, 2 mM ZnCl 2 . 

Scatchard analysis was performed as described by Cunningham et al. 1989. 
For competitive binding studies, 1 ng of 125 I-tracer toxin (rSMEZ, rSMES-2, rSPE-G, 
rSPE-H, rSEA, rSPE-C, or iTSST) was incubated with 0.0001 to 10 j^g of unlabeled 
20 toxin (rSMEZ, rSMES-2, rSPE-G, rSPE-H, rSEA, rSEB, rSPE-C, and rTSST) for lh. 
For SEB inhibition studies, 20 ng of 125 I-rSEB was used as tracer and samples were 
incubated for 4h. 

Results 

25 

Identification and sequence analysis of superantigens. 

The Oklahoma University Streptococcus pyogenes Ml genome database is accessible 
via the internet and contains a collection of more than 300 DNA sequence contigs 
30 derived from a shot gun plasmid library of the complete S. pyogenes Ml genome. 
The currently available DNA sequences cover about 95% of the total genome. 
We searched this database with a highly conserved superantigen peptide sequence, 
using a search program that screens the DNA database for peptide sequences in all 
6 possible reading frames. We found 8 significant matches and predicted the open 
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reading frames (ORFs) by aligning translated DNA sequences to complete protein 
sequences of known SAgs. 

Five matches gave complete ORFs with significant homology to streptococcal and 
5 staphylococcal superantigens. Three of these ORFs correlate to SPE-C, SSA and the 
recently described SMEZ (Kamezawa et al. 1997), respectively. The remaining two 
ORFs could not be correlated to any known protein in the SwissProt and PIR 
databases. We named these novel putative superantigen genes spe-g and spe-h. 
One ORF could not be generated completely due to its location close to the end of a 
10 contig. The DNA sequence of the missing 5 '-end is located on another contig, and 
individual contigs have yet to be be assembled in the database. However, the 
available sequence shows an ORF for the 137 COOH-terminal amino acid residues 
of a putative novel superantigen which could not be found in the existing protein 
databases. We named this so far incomplete gene spe-j. 

15 

In two cases a complete ORF could not be defined due to several out-of-frame 
mutations. Although DNA sequencing errors on the unedited DNA sequences can't 
be completely ruled out, the high frequency of inserts and deletions probably 
represent natural mutation events on pseudogenes, which are no longer used. 

20 

To produce recombinant proteins of SMEZ, SPE-G and SPE-H, individual genes 
(coding for the mature toxins without leader sequence) were amplified by PCR, and 
subcloned for DNA sequencing. Both, Str. pyogenes Ml and Str. pyogenes 2035 
genomic DNA were used and individual toxin gene sequences compared between the 

25 two strains. The spe-h gene was isolated from Ml strain, but could not be amplified 
from strain 2035 genomic DNA suggesting a restricted strain specificity for this 
toxin. The spe-g gene was cloned from both Ml and 2035, and DNA sequence 
analysis of both genes showed no differences. The full length smez gene was isolated 
from both strains, but DNA sequence comparison revealed some striking 

30 differences. The smez gene of strain 2035 showed nucleotide changes in 36 
positions (or 5%) compared to smez from strain Ml (Fig. 1). The deduced protein 
sequences differed in 17 amino acid residues (or 8.1%). This difference was 
sufficient to indicate a new gene. We named this gene smez-2, because it is 95% 
homologous to smez. 



35 
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The most significant difference between SMEZ and SMEZ-2 is an exchanged 
pentapeptide sequence at position 96-100, where the EEPMS sequence of SMEZ is 
converted to KTSIL in SMEZ2 (Fig. 1). A second cluster is at position 1 1 1-1 12, where 
an RR dipeptide is exchanged for GK in SMEZ-2. The remaining 10 different 
5 residues are spread over almost the entire primary sequence. 

A revised superantigen family tree, based on primary amino acid sequence 
homology now shows 3 general subfamilies; group A comprises SPE-C, SPE-J, SPE- 
G, SMEZ and SMEZ-2, group B comprises SEC 1-3, SEB, SSA, SPE-A and SEG and 
10 group C comprises SEA, SEE, SED, SEH and SEL Two superantigens, TSST and 
SPE-H do not belong to any one of those subfamilies. 

SMEZ, SMEZ-2, SPE-G and SPE-J are most closely related to SPE-C, increasing the 
number of this subfamily from 2 to 5 members. SPE-G shows the highest protein 
15 sequence homology with SPE-C (38.4% identity and 46.6% similarity). The 
homology of SPE-J to SPE-C is even more significant (56% identity and 62% 
similarity), but this comparison is only preliminary due to the missing NH2-terminal 
sequence. SMEZ shows 30.9% / 40.7% homology to SPE-C and SMEZ-2 is 92% / 
93% homologous to SMEZ. 

20 

SPE-H builds a new branch in the family tree and is most closely related to SED, 
showing 25% identity and 37.3% similarity. 

Multiple alignment of SAg protein sequences (Fig. 1) shows that similarities are 
25 clustered within structure determining regions, represented by a4, a5, 04 and P5 
regions. This applies to all toxins of the superantigen family (data not shown) and 
explains why superantigens like SPE-C and SEA have very similar overall structures 
despite their rather low sequence identity of 24.4 %. 

30 Although SPE-H is less related to SPE-C it shows 2 common features with the a SPE- 
C subfamily": (I) a truncated NH2- terminus, lacking the al region and (II) a primary 
zinc binding motif (H-X-D) at the C-terminus (Fig. 1). It has been shown for several 
superantigens that this motif is involved in a zinc coordinated binding to the (3-chain 
of HLA-DR1. 
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Fusion proteins of GST-SMEZ, GST-SMEZ-2 and GST-SPE-H were completely 
soluble and gave yields of about 30 mg per liter. The GST-SPE-G fusion was 
insoluble when grown at 37° C, but mostly soluble when expressed in cells growing 
at 28° C. Although soluble GST-SPE-G yields were 20-30 mg per liter, solubility 
5 decreased after cleavage of the fusion protein with trypsin. Soluble rSPE-G was 
achieved by diluting the GST-SPE-G to less than 0.2 mg/ml prior to cleavage. After 
cation exchange chromatography, purified rSPE-G could be stored at about 0.4 
mg/ml. 

10 Recombinant SMEZ could not be separated from GST by ion exchange 
chromatography. Isoelectric focusing revealed that the isoelectric points of the two 
proteins are too similar to allow separation (data not shown). Therefore, rSMEZ was 
released from GST by cleaving with trypsin while still bound to the GSH agarose 
column. Recombinant SMEZ was collected with the flow through. 

15 

The purified recombinant toxins were applied to SDS-PAGE and isoelectric focusing 
(Fig. 3). Each toxin ran as a single band on the SDS PAA gel confirming their purity 
and their calculated molecular weights of 24.33 (SMEZ), 24.15 (SMEZ-2), 24.63 
(SPE-G) and 23.63 (SPE-H) (Fig. 3 A). The isoelectric focusing gel (Fig. 3B) shows a 
20 significant difference between rSMEZ and rSMEZ-2. Like most other staphylococcal 
and streptococcal toxins, rSMEZ-2 possesses a slightly basic isoelectric point at pH 
7-8, but rSMEZ is acidic with an IEP at pH 6-6.5. 

T cell proliferation and VJ3 specificity 

25 To ensure the native conformation of the purified recombinant toxins, a standard 
[ 3 H]thymidine incorporation assay was performed to test for their potency to 
stimulate peripheral blood lymphocytes (PBLs). All toxins were active on human T 
cells (Fig. 4). Recombinant SEA, rSEB, rSPE-C and rTSST were included as 
reference proteins. The mitogenic potency of these toxins was lower than described 

30 previously, but is regarded as a more accurate figure. In previous studies performed 
by us, a higher starting concentration of toxin (100 ng/ml) was used and tips were 
not changed in between dilutions. This led to significant carryover across the whole 
dilution range. In this study, the starting concentration was 10 ng/ml and tips were 
changed in between dilutions preventing any carryover. 
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The half maximal response for rSPE-G and rSPE-H was 2 pg/ml and 50 pg/ml, 
respectively. No activity was detected at less than 0.02 pg/ml and 0.1 pg/ml, 
respectively. Both toxins are therefore less potent than rSPE-C. Recombinant SMEZ 
was similar in potency to rSPE-C, with a Pso% value of 0.08 pg/ml and no detectable 
proliferation at less than 0.5 fg/ml. Recombinant SMEZ-2 showed the strongest 
mitogenic potency of all toxins tested or, as far as we are aware of, described 
elsewhere. The Pso% value was determined at 0.02 pg/ml and rSMEZ-2 was still 
active at less than 0. 1 pg/ml. All Pso% values are summarized in Table 1. 




• m 



TABLE 1 



POTENCY OF RECOMBINANT TOXINS ON HUMAN AND MOUSE T CELLS. 



PROLIFERATION POTENTIAL Pso% [pg/ml] 



TOXIN 


HUMAN 


SJL 


B10.M 


B10/J 


C3H 


BALB/C 


SEA 


0.1 


20 


12 


1.8 


19 


1000 


SEE 


0.2 


10 


12 


1.5 


50 


15 


SEB 


0.8 


7000 


80,000 


5000 


10,000 


1000 


TSST 


0.2 


20 


1000 


1.2 


100 


10 


SPE-C 


0.1 


> 100,000 


> 100,000 


> 100,000 


> 100,000 


> 100,000 


SMEZ 


0.08 


80 


80 


100 


9000 


200 


SMEZ-2 


0.02 


100 


15 


10 


800 


18 


SPE-G 


2 


> 100,000 


> 100,000 


> 100,000 


. > 100,000 


>100,000 


SPE-H 


50 


15 


800 


5000 


100 


1000 



Human PBLs and mouse T cells were stimulated with varying amounts of recombinant toxin. The Pso* value 
reflects the concentration of recombinant toxin required to induce 50% maximal cell proliferation. No 
proliferation was detected for rSPE-C and rSPE-G at any concentration tested on murine T cells. 
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Murine T cells from 5 different mouse strains were tested for their mitogenic 
response to rSMEZ, rSMEZ-2, rSPE-G and rSPE-H (Table 1). Recombinant SPE-G 
showed no activity against any of the mouse strains tested. Recombinant SPE-H, 
rSMEZ and rSMEZ-2 showed varied potency depending on the individual mouse 
5 strain. For example, rSMEZ-2 was 500-fold more potent than rSPE-H in the BIO/ J 
strain, while rSPE-H was 7.5-fold more active than rSMEZ-2 in the SJL strain. 

The most consistently potent toxin on murine T cells was rSMEZ-2 with Pso% values 
of 10 pg/ml in BIO/ J and 800 pg/ml in C3H. Recombinant SMEZ varied between 80 
10 pg/ml in SJL and B10.M and 9000 pg/ml in C3H. The Pso% value for rSPE-H was 
between 15 pg/ml in SJL and 5000 pg/ml in B10/ J. 




TABLE 2 





vp SPECIFICITY OF RECOMBINANT TOXINS ON HUMAN PBLS 




PERCENT VP ENRICHMENT 


vp 


Resting 


SMEZ 


SMEZ-2 


SPE-G 


SPE-H 


1.1 


0.2 


0.3 


0.4 


1.2 


1 


2.1 


0.4 


8.4 


1 


17.9 


8.6 


3.2 


4.8 


3.1 


2.5 


3 


2.4 


4.1 


3.5 


24.8 


14.4 


11.2 


5.J2 


5.1 


6.2 


1.4 


2.5 


5.7 


2.2 


5.3 


5.6 


2.2 


4.1 


4.7 


4.1 


6.3 


3 


0.8 


2.3 


4.7 


3.5 


6.4 


5.4 


2.1 


5.9 


9.6 


5.6 


6.9 


6.9 


3.5 


9.3 


19.1 


12.2 


7.3 


3.5 


15.3 


7.3 


3.2 


12.6 


7.4 


9 


13.5 


11.7 


2.9 


6.3 


Q 1 

O. 1 , 


O. 1 


on v 


^6 

uy 


4.5 


2.4 


9.1 


0.3 


0.05 


0 


1.2 


2.3 


12.3 


0.8 


1.6 


2 


3.2 


2.6 


12.5 


3 


1.2 


2 


3 


2.3 


15.1 


0.6 


0.5 


0.7 


1.2 


0.8 


23.1 


0.2 


0.1 


0.3 


0.8 


1 


total 


62.1 


99.7 


102.8 


97.1 


75.2 



Human PBLs were incubated with 20 pg/ml of recombinant toxin for 4d. Relative enrichment of Vp cDNAs was 
analyzed from RNA of stimulated and re ting PBLs by anchored primer PCR and reverse dot blot to a panel of 17 
different Vp cDNAs. 



The values representing the highest Vp enrichment are underlined. 
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The human TcR VP specificity of the recombinant toxins was determined by 
multiprimer anchored PCR and dot blot analysis using a panel o£ 17 human V(3 DNA 
regions. The VP enrichment after stimulation with toxin was compared to the VP 
profile of unstimulated PBLs (Table 2). The sum total of all Vps stimulated by 
5 rSMEZ, rSMEZ-2 and rSPE-G was close to 100 % suggesting that the VPs used in 
the panel represent all the targeted Vps. On the other hand, the total of the VPs 
stimulated by rSPE-H was only 75%. It is therefore likely that rSPE-H also 
stimulated some less common VPs, which are not represented in the panel. The 
most dramatic response was seen with all toxins, except rSMEZ2, on Vp2.1 bearing 
10 T cells (21-fold for rSMEZ, 45-fold for rSPE-G and 22-fold for rSPE-H). In contrast, 
rSMEZ2 gave only a 2.5-fold increase of VP2.1 T-cells. SPE-G also targeted Vp4.1, 
Vp6.9, Vp9.1 and VP12.3 (3-4 fold). A moderate enrichment of VP12.6, Vp9.1 and 
VP23.1 (4-8 fold) was observed with rSPE-H. Both, rSMEZ and rSMEZ2, targeted 
VP4. 1 and VP8.1 with similiar efficiency (3-7-fold). This finding is of particular 
15 interest, because VP8. 1 activity had been found in some, but not all Str. pyogenes 
culture supernatants and in crude preparations of SPE-A and SPE-C. Moreover, 
SPE-B has often been claimed to have VP8 specific activity, but has since been 
shown to be a contaminant previously called SpeX. The ability of rSMEZ and 
rSMEZ-2 to stimulate the VP8.1 Jurkat cell line was tested (Fig. 5) Recombinant 
20 SMEZ was less potent than the control toxin (rSEE), showing a half maximal 
response of 0.2 ng/ml, compared to 0.08 ng/ml with rSEE, but rSMEZ-2 was more 
potent than rSEE (0.02 ng/ml). No proliferation activity was observed with the 
negative control toxin rSEA. 

MHC class II binding 

To determine if there were significant structural differences, the protein structures 
of SMEZ-2, SPE-G and SPE-H were modelled onto the superimposed structurally 
conserved regions of SEA, SEB and SPE-C. The models showed that in all three 
proteins, the 2 amino acid side chains of the COOH-terminal primary zinc binding 
motif are in close proximity to a third potential zinc ligand to build a zinc binding 
site, similar to the zinc binding site observed in SEA and SPE-C. 




35 



The zinc binding residues in SPE-C are H167, H201, D203, and it is thought that 
H81 from the HLA-DR1 P-chain binds to the same zinc cation to form a regular 
tetrahedral complex. The two ligands of the primary zinc binding motif, H201 and 



88860 WGN 




21 



D203, are located on the pl2 strand, which is part of the P~ grasp motif, a common 
structural domain of superantigens. The third ligand, H167, comes from the piO 
strand (Roussel et al. 1997). 

5 In the model of SPE-G three potential zinc binding ligands (H167, H202 and D204) 
are located at corresponding positions. In the SMEZ-2 and the SPE-H models, the 
two corresponding pi2 residues are H202, D204 and H198, D200, respectively. The 
third ligand in SPE-H (D160) and in SMEZ-2 (H162) comes from the P9 strand and is 
most similar to H187 in SEA. It has been shown from crystal structures that H167 
10 of SPE-C and H187 of SEA are spatially and geometrically equivalent sites (Scad et 
al. 1997, Embo J 14 no 14:3292-301; Roussel et al. 1997). 

All superantigens examined so far, except SPE-C, bind to a conserved motif in the 
MHC class II al-domain. In SEB and TSST, hydrophobic residues on the loop 

15 between the pi and P2 strand project into a hydrophobic depression in the MHCII 
al-domain. This loop region has changed its character in SPE-C, where the 
hydrophobic residues ( F44, L45, Y46 and F47 in SEB) are substituted by the less 
hydrophobic residues T33, T34 and H35. A comparison of this region on the 
computer generated models revealed that the generic HLA-DR1 a-chain binding site 

20 might also be missing. As the loop regions are generated by random choice, no 
conclusions can be made from their conformation in the models. However, in none 
of the three models does the pi-P2-loop have the required hydrophobic features 
observed in SEB and TSST Swaminathan, S. et al, Nature 359, No. 6398:801-6 
(1992), Acharya et al, Nature 367, No. 6458: 94-7 (1994). The residues are 125, D26, 

25 F27, K28, T29 and S30 in SMEZ-2, T31, T32, N33, S34 in SPE-G and K28, N29, S30, 
P31, D32, 133, V34 andT35 in SPE-H. 

SMEZ-2 differs from SMEZ in only 17 amino acids. In the model of SMEZ-2 with the 
position of those 17 residues, most of the exchanges are located on loop regions, 
30 most significantly on the p5~p6 loop with 5 consecutive residues replaced. The 
potential zinc binding site and the P1-P2 loop are not affected by the replaced amino 
acids. 

The TcR Vp specificity differs between SMEZ and SMEZ-2 by one Vp. SMEZ strongly 
35 stimulates Vp2 T cells, but SMEZ-2 does not (Table 2). One or more of the 17 
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exchanged residues in SMEZ/SMEZ-2 might therefore be directly involved in TcR 
binding. The exact position of the TcR binding site can not be,predicted from the 
model as several regions have been implicated in TcR binding for different toxins. 
Crystal structures of SEC2 and SEC3, complexed with a TcR 0-chain indicated the 
5 direct role of several residues located on a2, the 02-03 loop, the 04-05 loop and oc4 
(Fields et al. 1996 Nature 384 no 6605:188-92). On the other hand, binding of TSST 
to the TcR involves residues from a4, the {37-08 loop and the a4-09 loop (Acharya et 
al. 1994, Nature 367 no 6548:94-7). The SMEZ-2 model shows 3 residues, which 
might contribute to TcR binding. In SMEZ, Lys is exchanged for Glu at position 80 
10 and Thr is exchanged for lie at position 84, both on the 04-05 loop. On the COOH- 
terminal end of the a4 helix, Ala is replaced by Ser at position 143. 

The results from the computer modelled protein structures suggest that all 4 toxins, 
SMEZ, SMEZ-2, SPE-G and SPE-H, might bind to the HLA-DR1 0- chain in a zinc 
15 dependent fashion, similar to SEA and SPE-C, but might not be able to interact with 
the HLA-DR1 a-site, a situation that has so far only been observed with SPE-C 
(Roussel et al. 1997; Li et al. 1997). 

To find out whether or not zinc is required for binding of the toxins to MHC class II, 
20 a binding assay was performed using human LG-2 cells (which are MHC class II 
expressing cells homozygous for HLA-DR1). Direct binding of 125 I-labeled toxins was 
completely abolished in the presence of 1 mM EDTA (Fig. 6, Table 3). When 2 mM 
ZnCb was added, binding to the LG-2 cells could be restored completely. These 
results show that the toxins bind in a zinc dependent mode, most likely to the HLA- 
25 DR1 0-chain similar to SEA and SPE-C. However, it does yet not exclude the 
possibility of an additional binding to the HLA-DR1 a-chain. 




TABLE 3 



BINDING AFFINITIES AND ZINC DEPENDENCIES FOR SUPERANTIGENS TO 

HUMAN CLASS II 



TOXIN 


MHC CLASS II BINDING 

kd fnMl 


7ywp Tiir'pxr'N'T^Trwr^v 


SEA 


^fi / i nnn 

ouy iuuu 


++ 


SEB 


340 




TSST 


130 




SPE-C 


70 


++ 


SMEZ 


65/1000 


++ 


SMEZ-2 


25/1000 


++ 


SPE-G 


16/1000 


+-f- 


SPE-H 


37/2000 


++ 



The binding affinities of the toxins to MHC class II were determined by Scatchard analysis using LG-2 cells. 
Zinc dependency was determined by binding of recombinant toxins to LG-2 cells in the presence and absence of 
EDTA, as described in the Materials and Methods section. 

The biphasic binding of SEA to HLA-DR1 can be deduced from Scatchard analysis. It 
shows that SEA possesses a high affinity binding site of 36 nM (which is the zinc 
dependent P- chain binding site) and a low affinity binding site of 1 jaM (a-chain 
binding site). On the other hand, only one binding site for HLA-DR1 was deduced 
from Scatchard analysis with SEB, TSST and SPE-C, respectively (Table 3). 

Therefore, Scatchard analysis was performed with radiolabeled rSMEZ, rSMEZ-2, 
rSPE-G and rSPE-H using LG-2 cells. All four toxins showed multiphasic curves 
with at least 2 binding sites on LG-2 cells, a high affinity site of 15-65 nM and a low 
affinity site of 1-2 \xM (Fig. 7, Table 3). 
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In a further attempt to determine the orientation of the toxins on MHC class II 
competition binding experiments were performed. The recombinant toxins and 
reference toxins (rSEA, rSEB, rSPE-C and iTSST) were radiolabeled and tested with 
5 excess of unlabeled toxin for binding to LG-2 cells. The results are summarized in 
Fig. 8. Both, rSEA and rSPE-C, inhibited binding of labeled rSMEZ, rSMEZ-2, rSPE- 
G and rSPE-H, respectively. However, rSPE-C only partially inhibited binding (50%) 
of the labeled rSMEZ-2 {Fig. 9). Recombinant SEB did not compete with any other 
toxin, even at the highest concentration tested. Recombinant TSST was only 
10 slightly competitive against 125 I-labeled rSMEZ, rSMEZ-2 and rSPE-G, respectively, 
and did not inhibit rSPE-H binding at all. 

Reciprocal competition experiments were performed. Recombinant SMEZ, rSMEZ-2 
and fSPE-H prevented 125 I-rSEA from binding to LG-2 cells. However, only partial 
15 competition (50%) was observed even at the highest toxin concentrations (10,000 
fold molar excess). Recombinant SPE-G did not prevent binding of l25 I-rSEA and 
1251-xTSST binding was only partially inhibited by rSMEZ, rSMEZ-2 and rSPE-H, but 
not by rSPE-G. Significantly, none of the toxins inhibited 125 I-rSEB binding, even at 
the highest concentration tested. 

20 

In a further set of competition binding experiments, rSMEZ, rSMEZ-2, rSPE-G and 
rSPE-H were tested for competition against each other. Both, rSMEZ and rSMEZ-2 
competed equally with each other and also prevented binding of labeled rSPE-G and 
rSPE-H. In contrast, rSPE-G and rSPE-H did not inhibit any other toxin binding 
25 suggesting that these toxins , had the most restricted subset of MHC class II 
molecules, which represent specific receptors. 

The foregoing examples are illustrations of the invention. The invention may be 
carried out with the numerous variations and modifications as will be apparent to 
30 those skilled in the art. For example, a native superantigen may be replaced by a 
synthetic superantigen with on or more deletions, insertions and/or substitutions 
relative to the corresponding natural superantigen, provided that the superantigen 
activity is retained. Likewise there are many variations in the way in which the 
invention can be used in other aspects of it. 
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